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Abstract. We show that the problem of computing the distance of a given 
permutation from a subgroup H of Sn is in general NP-complete, even under 
the restriction that H is elementary Abelian of exponent 2. The problem 
is shown to be polynomial-time equivalent to a problem related to finding a 
maximal partition of the edges of an Eulerian directed graph into cycles and 
this problem is in turn equivalent to the standard NP-complete problem of 
Boolean satisfiability. 



1. Introduction 

We show that the problem of computing the distance of a given permutation 
from a subgroup H of Sn is in general NP-complete, even under the restriction that 
H is elementary Abelian of exponent 2. The problem is polynomial-time equivalent 
to finding a maximal partition of the edges of an Eulerian directed graph into cycles 
and this is turn equivalent to the standard NP-complete problem 3-SAT. 

2. Distance in the symmetric group 

We define Cayley distance in a symmetric group as the minimum number of 
transpositions which are needed to change one permutation to another by post- 
multiplication 

d{p, tt) = min {n \ pri . . .Tn = n, Ti transpositions } . 

It is well-known that Cayley distance is a metric on 5„ and that it is homogeneous, 
that is, d{p,Tr) — (i(/,p~^7r). Further, the distance of a permutation tt from the 
identity in 5„ is n minus the number of cycles in tt. 

If if is a subgroup of Sn, then we define the distance of a permutation tt from 
H as 

d{H,n) — min(i(77, tt). 

We refer to Critchlow [5] and Diaconis |3| for background and further material 
on the uses of the Cayley and other metrics on Sn- 

Problem 1 (Subgroup-Distance). Instance: Symmetric group Sn, element tt G 
Sn, elements {hi, . . . , hr} of Sn, integer K . 

Question: Is there an element rj ^ H ~ (hi, . . . , hr) such that d{rj,TT) < K? 

The natural measure of this problem is nr where r is the length of the list 
of generators. The following result shows that every subgroup of Sn has a set of 
generators of length at most and hence we are justified in taking n as the measure 
of the various problems derived from Subgroup— Distance. 

Proposition 1. Every subgroup of Sn can be generated by at most elements. 
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Proof. Let _ff be a subgroup of 5„. It is clear that H is generated by the union 
of one Sylow subgroup for every prime p dividing the order ^H. The order of a 
Sylow p-subgroup of H is p'' where p'' divides and hence nl. It is well-known 
(e.g. Dickson ^ I, chap 9) that the power of p dividing n\ is at most so b < n. 
But consideration of the composition factors shows that a p-group of order p'' can 
be generated by a set of at most b elements: hence any Sylow subgroup of H can 
be generated by at most n elements. Further, the set of prime factors of the order 
of H forms a subset of the set of prime factors of nl, that is, of the primes up to 
n, and there are at most n such primes. Hence H can be generated by a set of at 
most elements. 

□ 

Although we do not need the stronger result, it can be shown that any subgroup 
of S„ can be generated by at most 3n — 2 elements. 

We define a subset of S„ to be involutions with disjoint support (IDS) to be set 

of elements of the form jj = (^a^j^' Uj^''^ ■ ■ ■ Vj^'^^ where the are all 

distinct. The subgroup generated by an IDS is clearly elementary Abelian with 
exponent 2. Define the width of an IDS to be the maximum number of 2-cycles 
rj in the generators 7j. The problem IDSw— Subgroup— Distance is the problem 
Subgroup— Distance with the list of generators restricted to be an IDS of width 
at most w. 

Theorem 2. The problem IDS6— Subgroup— Distance is NP-complete. 

The Theorem will follow from combining Theorem 3 and Theorem 7. We de- 
duce immediately that the more general problem Subgroup— Distance is also NP- 
complete. 

By contrast, the problem of deciding whether the distance is zero, that is, testing 
for membership of a subgroup of Sn, has a polynomial-time solution, an algorithm 
first given by Sims and shown to have a polynomial-time variant by Furst, 
Hopcroft and Luks See Babai, Luks and Seress ^ and Kantor and Luks 7 for 
a survey of related results. 

3. Switching circuits 

Let G ~ [V, E) be a directed graph with vertex set V and edge set E. (We allow 
loops and multiple edges.) For each vertex v define e+{v) to be the set of edges out 
of V and e_(w) the set of edges into v. The in- valency d-{v) = #e_(w) and the out- 
valency d+{v) = #e+(w). We define a switching circuit to be a directed graph G for 
which d+{v) — d-{v) = d{v), say, and for which there is a labelling l±{v) of each set 
e±{v) with the integers from 1 to d{v). (The labels at each end of an edge are not 
related.) A routing p for a switching circuit is a choice of permutation p{v) g 5'g(„) 
for each vertex v. Clearly there is a correspondence between routings for a switching 
circuit G and decompositions of the edge set of G into directed cycles. We define 
a polarisation T for a switching circuit G to be an equivalence relation on the set 
of vertices such that equivalent vertices have the same valency, and call (G, T) a 
polarised switching circuit. We say that a routing p respects the polarisation T if 
the permutations p{x) and p{y) are equal whenever x and y are equivalent vertices 
under T. We shall sometimes refer to a switching circuit without a polarisation, or 
with a polarisation for which all the classes are trivial, as unpolarised. 

Problem 2 (Polarised-Switching-Circuit-Maximal-Routing). Instance: Polarised 
switching circuit {G,T), positive integer K. 

Question: Is there a routing which respects T and has at least K cycles in the 
associated edge-set decomposition? 
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We define the width of a polarisation to be the maximum number of vertices in an 
equivalence class of T. The problem Widthui—Valencyii—MEiximal— Routing is 
the problem Polarised Switching Circuit Maximal Routing with the width 
of T constrained to be at most w and the in- and out- valency of each vertex in V 
constrained to be at most v. 

Theorem 3. Problem Width6 Valency2 Maximal Routing is NP- complete. 

4. Proof of Theorem 3 

We shall show that the problem 3-SAT, [L02] of Garey and Johnson 6 , which 
is known to be NP-complete, can be reduced to the problem Width6— Valency2— 
MfLximal Routing. 

We define a polarised switching circuit (G, T) to be Boolean (or binary) if every 
vertex has in- and out-valency 1 or 2. To each class C of the polarisation T we asso- 
ciate a Boolean variable a(C). There is then a 1-1 correspondence between routings 
p which respect T and assignments of truth values to the variables a(C), C G T by 
specifying that o(C) is (false) if and only if the permutation p{v) is the identity 
in 5*2 for every in C, and 1 (true) if and only if p{v) = (1 2). 

We denote a vertex in a polarisation class associated with the Boolean variable 
a as in Figure 1. Out convention for drawing the diagrams will be to assume the 
edges round each vertex labelled so that 1 is denoted by either "straight through" 
or "turn right" . 




Figure 1. A vertex in a switching circuit associated with the 
Boolean variable a, and the routings with a = 1 and a = re- 
spectively 

We associate a vertex with the negated variable a by exchanging the input labels 
1 and 2. 

Our proof will proceed by finding polarised switching circuits for which the num- 
ber of maximal cycles in a routing is a Boolean function of the variables. 

For a single Boolean variable a define /(a) to be the switching circuit in Figure 

2. 




Figure 2. The switching circuit I {a). 
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For a pair of Boolean variables (a, b) define the polarised switching circuit E{a, b) 
as in Figure 3. 




Figure 3. The switching circuit E{a,b). 
Further define the polarised switching circuit F{a, b) as in Figure 4. 




Figure 4. The switching circuit F{a,b). 

Define G{a, b) to be the disjoint union of F{a, b) and E{a, b). 

Proposition 4. (1) The number of cycles in a routing for I{a) is 2 if a = 1 
and otherwise 1. 

(2) The number of cycles in a routing for E{a, b) is 2 if a = b and otherwise 1. 

(3) The number of cycles in a routing for F{a, b) is 2 if a ^ b, 3 if a = b = 1 
and 1 if a = b = 0. 

(4) The number of cycles in a routing for G{a, b) is 2 if a = b = and 4 
otherwise. 

Proof. In each case we simply enumerate the cases. 

□ 

For a triple of Boolean variables (a, b, c) define the polarised switching circuit 
A{a, b, c) as in Figure 5. 

Proposition 5. The number of cycles in a routing for A{a, b,c) is 1 ifa = b = c = 
and 3 otherwise. 

Proof. Again, in each case we simply enumerate the cases. 

□ 

Theorem 6. There is a polynomial-time parsimonious transformation from the 
problem 3-SAT to the problem Width6—Valency2— Maximal— Routing. 

Proof. Suppose we have an instance of 3-SAT: that is, a Boolean formula $ of 

length I in variables Xi which is a conjunct of k clauses each of which is a disjunct 
of at most three variables (possibly negated). We transform $ into a formula 
in variables yj by replacing the j*^ occurence of variable Xj by the variable y^ and 
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Figure 5. The switching circuit A{a,b,c). 



conjoining clauses (yl = yf) A ... A [y^ = j where the variable Xi occurs 

Vi times in Clearly $ and represent the same Boolean function and have the 
same number of satisfying assignments. Every variable in occurs at most three 
times, and at most once in a disjunct deriving from a clause in Let n be the 
total number of variables in certainly n < I. 

Wc form a polarised switching circuit ^' from <!>' as follows. Take a circuit 
B{x, y, z) for every clause in $' of the form (x V y V z); take a circuit G{x, y) for 
every clause in of the form {x \/ y); take a circuit I{x) for every clause in of 
the form {x); take a circuit E(x,y) for every clause in $' of the form [x = y). Let 
the number of circuits of types B, G, I and E taken to form "if be b, g, i, and e 
respectively. Put M = 3b + + 2i + 2e. The resulting polarised switching circuit 
has n classes, and each class in the polarisation is involved in at most one circuit of 
the form B, G or /: hence each class contains at most 4+1 + 1 = 6 vertices and the 
number of vertices in ^ is thus at most 6n. Furthermore, a routing for ^ has M 
cycles if and only if the corresponding assignment of Boolean values gives and 
hence the value 1; otherwise a routing has less than M cycles. 

□ 

Since the problem 3-SAT is known to be NP-complete, we immediately de- 
duce that the problem Widthl6—Valency2— Maximal— Routing is NP-complete 
as well. This proves Theorem 3. 

5. Switching circuits and IDS 

In this section we obtain a polynomial-time equivalence between the problems 
Widthw—Valency2— Maximal— Routing and IDSw— Subgroup— Distcince. 

Theorem 7. There is a polynomial-time parsimonious equivalence between prob- 
lems Widthw—Valency2— Maximal— Routing andlDSw— Subgroup— Distance. 

Proof. Suppose wc have an instance of IDS?i7— Subgroup— Distance, that is, an 
element tt of Sn together with an IDS on a set of t generators {7^} with 7^ = 

((1) / (r) (r)\ (i) (i) 

j y) I [ j y) I ' ^^"^ j 'Vj distinct and all the Vj < w. We con- 
struct a polarised switching circuit on a graph, vertex set V = {^'(1), • • • , P{n)} U 
{Q{1,1), . . . ,Q{t,rt)}. Each vertex P{k) will bo of in-valence and out-valencc 1: 
each vertex Q{j,i) will be of in-valence and out- valence 2. For each j up to t 
and i up to rj we take edges from P{Xj) and from P(y]) to Q{j,i) labelled 1 and 
2 respectively, and edges from Q{j,i) to P [tt (xj)) and to P(7r(?/*)) again la- 
belled 1 and 2 respectively. We define a polarisation T on V by taking t classes 
Cj = {Q{j, i) I i = 1, . . . , Tj}; clearly the width of T is at most w. 
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Conversely, suppose we have an instance of Widtht«—Valency2— Maximal- 
Routing, that is, a directed graph {V, E) with every vertex v having in- and out- 
valency two, a labelling l±{v) : e±{v) — > {1,2} of edges into and out of each vertex 
V, and an equivalence relation T on y with t classes each of size at most w. Put 
n = We define a permutation tt of _E as follows. For an edge e into a vertex 
V, let 7r(e) be the edge / out of v which has label l+{v){f ) equal to l-{v){e). We 
further define an IDS by writing down a set of generators {7^} as follows. For 

each class of vertices Cj = ^vj | i = 1, ... ,rj| in the polarisation T, let 7^ be the 
product of transpositions of the form {^flgl^ where // and gl are the edges out 

of vertex vl- Since each class in T has at most w elements, each generator 7^ is 
composed of at most w transpositions. 

In each case there is a correspondence between routings p of the switching circuit 
which respect the polarisation T and permutations of the form -kt] where r] runs over 
the elements of the subgroup H of Sn generated by the 7j: in this correspondence the 
number of cycles in the routing p is equal to the number of cycles in the permutation 
-KT]. Hence tt is within distance d of the group generated by the 7^ if and only if 
there is a routing p with at least n — d cycles. 

□ 
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